Mon. Not. R. Astron. Soc. 000, 000-000 (0000) Printed 27 April 2010 (MN KTeX style file v2.2) 



A comparison of frequentist and Bayesian inference: Searching for 
low-frequency p modes and g modes in Sun-as-a-star data 



o 

Ph. 

6 



> 

o 
in 

o 
o 



A. M. Broomhall^*, W. J. Chaplin^ Y. Elsworth\ T. Appourchaux^, R. New^ 

^School of Physics and Astronomy, University of Birmingham, Edgbaston, Birmingham B15 2TT 
^Institut d'Astrophysique Spatiale, CNRS-Universite Paris XI UMR 8617, 91405 Orsay Cedex, France 
^Faculty of Arts, Computing, Engineering and Sciences, Sheffield Hallam University, Sheffield SI IWB 



27 April 2010 



ABSTRACT 

We describe and use two different statistical approaches to try and detect low-frequency 
solar oscillations in Sun-as-a-star data: a frequentist approach and a Bayesian approach. We 
have used frequentist statistics to search contemporaneous Sun-as-a-star data for coincident, 
statistically-prominent features. However, we find that this approach leads to numerous false 
detections. We have also used Bayesian statistics to search for evidence of low-frequency p 
modes and g modes in Sun-as-a-star data. We describe how Bayesian statistics can be used to 
search near-contemporaneous data for coincident prominent features. Near-contemporaneous 
data were used to circumvent the difficulties in deriving probabilities that occur when common 
noise is present in the data. We find that the Bayesian approach, which is reliant on the 
assumptions made when determining the posterior probability, leads to significantly fewer 
false detections and those that are observed can be discredited using a priori knowledge. 
Therefore, we have more confidence in the mode candidates found with Bayesian statistics. 

Key words: methods: data analysis, methods: statistical. Sun: helioseismology, Sun: oscilla- 
tions 



1 INTRODUCTION 

Low-frequency p modes are expected to have very long lifetimes 
and so, if detected, their frequencies can be determined both accu- 
rately and precisely. This, in turn, makes low-frequency p modes 
important for constraining models of the structure of the solar inte- 
rior. Only p modes with low degrees (low I) travel through regions 
close to the solar centre and so observations of low-/ p modes are 
important for inferring the structure of the solar core. Mode ampli- 
tudes decrease with frequency and this, coupled with the increasing 
level of noise at lower frequencies, means that below 1500yuHz 
oscillation signatures are difficult to detect. No independently con- 
firmed detections of the core-pe netrating, low-/ p m o des have been 
made be low ~ 970a^Hz (e .g. lOarcfa et all 1200 ll : IChaplin et all 
[2002; Broomhall et al.ll2007h . 

The large sound speed in the solar core means that the effect of 
core conditions on low-degree p modes is relatively small. On the 
other hand, solar gravity (g) modes, which have frequencies below 
450 /jHz, are confined to the radiative interior and core, making 
them an extremely sensitive probe of th e deep solar interior (see e.g. 
lOarci'a et al]|2008l : iMathur et alj|2008h . Since g modes are evanes- 
cent outside the radiative interior their signatures are significantly 
attenuated by the time they reach the photosphere, where helioseis- 
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mic observations are made. It is, therefore, a major observa tional 
challenge to observe these modes. lAppourchaux et al.l | |2010|) is an 
up-to-date review o f the current status of the search for g modes. 
lOarcia et al.l ( |2007|) claim to have found evidence of the signatures 
of g modes in data observed by the Global Oscillations at Low 
Frequencies (GOLF) i nstrument, onboard S olar and Heliospheric 
Observatory (SOHO). iGarcia et al.l (|20()8|) found similar results 
when they examin ed 20 per cent more GOLF data than used by 
iGarcia et al.l ( l2007h . However, these results await independent con- 
firmation and, to date, there have been no detections of individual 
g modes. 

S ome solar oscillations are thought to show a mixed character 
(e.g. IProvost. Berthomieu & Moreill2000l) . They have a sensitivity 
to the Sun's internal structure that looks like a g mode, and so 
have excellent diagnostic potential, but they respond to surface 
conditions in a manner that is similar to p modes. Therefore, the- 
oretically, mixed modes should be easier to detect than g modes, 
however, no mixed mo des have yet been de finitively identified. 
iGarcia et all l l200ll) and iGabriel et all | |2002|) tentatively identify 
one member of the « = 1, / = 1 p mode, which is thought to 
be mixed. However, they are unable to determine which azimuthal 
component is detected. 

Any signal from a low-frequency p mode (below ISOOyuHz) 
or g mode is expected to be small compared to the surrounding 
background noise and so the following question arises: How does 
an observer decide whether a detection is real or not? This is 
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where statistical testing becomes important as it allows an observer 
to quantify the significance of any potential detection. To date 
the majority of statistical-based searches for low-degree p modes 
and g modes have taken a frequentist approach, i.e. it is com- 
mon to determine the probability of observing a certain feature 
in a frequency-power spectrum based on the hypothesis that the 
spectrum contai ns pure noise. This is the HO hypothesis. However, 
recent work by lAppourchaux ^ implies that the 'P values', 

which are quoted to assess the significance of a detection in a fre- 
quentist approach, may be too optimistic. It is, therefore, useful to 
also determine the probability of observing a feature in a frequency- 
power spectrum based on the alternative HI hypothesis. The HI 
hypothesis assumes that there is a signal in the data. However, to 
calculate this probability is it necessary to make certain a priori as- 
su mptions about th e signal and so a Bayesian approach is appropri- 
ate. |A220urch^3 proposes that Bayesian statistics provides a more 
rigourous method of assessing the significance of any mode candi- 
dates than a frequentist approach. When using Bayesian statistics it 
is possible to use our knowledge of the structure of the Sun to put 
more constraints on the search for solar oscillations than when us- 
ing a frequentist approach. In this paper we use both frequentist and 
Bayesian statistics to search for low-frequency p modes, g modes 
and mixed modes. 

We have taken care to minimize the number of false detections. 
There are two ways in which false detections can be made. A type I 
error occurs when the null hypothesis is wrongly rejected. Here, 
that would mean wrongly rejecting the hypothesis that a prominent 
structure is due to noise. In other words a type I error would occur 
if we flagged a structure as a mode candidate when it is noise. A 
type II error occurs when the null hypothesis is not rejected when, 
in fact, it is false. In this paper that would mean accepting the 
hypothesis that a feature is due to noise when it is not. We regard 
type I errors as more serious than type II errors. Therefore we have 
decided to err on the side of caution and set very stringent require- 
ments that need to be passed before a structure is considered as a 
mode candidate. For example, we require that any mode candidates 
be close to the mode frequencies predicted by solar models. While 
this restriction may increase the number of type II errors it is also 
likely to restrict the number of type I errors, which we consider to 
be more important. 

The structure of this paper is as follows: we begin, in Section[2l 
with a comparison of the frequentist and Bayesian approaches. We 
then give a more detailed description of how frequentist statistics 
were used to search various data sets (Section O. We have com- 
pared contemporaneous data for coincident features that are promi- 
nent when compared to the background noise. When determining 
the probability of observing such features in contemporaneous data, 
it is necessary to take account of any common noise that is present 
in the data due to, for example , the solar granulation. In order to 
do this iBroomhall et alj ( |2007|) derived a joint probability based 
on the assumption that the background noise has a Gaussian dis- 
tribution, which is true in frequency-amplitude spectra. Therefore, 
when using the frequentist approach to search for low-frequency 
solar oscillations we have examined fvaqasncy -amplitude spectra. 

In Section |4] we give a brief introduction to Bayesian statistics 
and describe how they can be used to search for low-frequency 
oscillations in solar data. Bayesian statistics can be used to de- 
termine the 'posterior probability' that a hypothesis, such as HO, 
is true given a set of observed data. The derivation of the pos- 
terior probability is based on the assumption that the background 
noise has a^^, 2 degrees of freedom (d.o.f.) distribution, which is 
true in frequency-power spectra. Therefore, when using a Bayesian 



approach to search for low-frequency solar oscillations we have 
examined frequency-power spectra. In Section |5] we discuss one 
component of the Bayesian calculations: the prior probability. In 
particular we determine whether this probability can make use of 
a priori knowledge to guide searches for solar oscillations. To date 
Bayesian statistics have only been used to search individual data 
sets for evidence of oscillations. We have used a Bayesian approach 
to derive the posterior probability that a coincident prominent fea- 
ture will be detected in data observed by different instruments (Sec- 
tion |6j. 

We have searched three sets of data for evidence of low- 
frequency oscillations using both frequentist and Bayesian ap- 
proaches. The data were observed by the Birmingham Solar Oscil- 
lations Network (BiSON), and the GOLF and Michelson Doppler 
Imager (MDI) instruments onboard the SOHO spacecraft. The re- 
sults of searching the data are described in Section|7] A comparison 
of the results of the two different approaches is made in Section[8] 



2 A GENERAL COMPARISON BETWEEN 

FREQUENTIST AND BAYESL4N STATISTICS 

When using a frequentist approach to search for low-frequency so- 
lar oscillations the probability of observing, by chance, a prominent 
feature in a range of N frequency bins in a frequency-amplitude/ 
power spectrum is often determined. The determined probability is 
dependent on the value of A', which must be chosen carefully and 
the choice of A' is discussed in more detail in Section[3] 

When adopting a frequentist approach to statistics it is common 
to set, a priori, an arbitrary probability detection threshold level, for 
example, at 1 per cent. If the probability of observing a prominent 
feature is less than this threshold probability the feature is consid- 
ered as a mode candidate. However, the meaning of the threshold 
probability is commonly misunderstood. Take, for example, the 
frequentist approach that was used here. We determined the prob- 
ability of observing a prominent spike in a frequency-amplitude 
spectrum by chance, if the HO hypothesis is true. The HO hypothesis 
assumes that the data being examined contain only noise. However, 
the determined probability, or P value, is commonly misinterpreted 
as the probability that the HO hypothesis itself is true. In fact, we 
can say nothing about the validity of the hypothesis. 

The problem occurs because of the non-repeatability of the ex- 
periment: we only have one Sun to obser ve. A similar problem 
has been identifie d in medical statistics (e.g. lBerger&Sellkell987l: 
ISellke et al.l200lh . where it has been shown that the probability that 
the HO hypothesis is correct is significantly larger than the P value 
quoted in a frequentist approach. Consequently, great care must 
be taken when drawing conclusions from frequentist statistics. It 
must be made clear the frequentist threshold probability gives the 
chances of the data being observed if the underlying hypothesis 
is true and gives no information on the validity of the hypothesis 
itself. 

To determine the probability that a given hypothesis is true we 
must turn to Bayesian statistics. Bayesian statistics allow us to use a 
priori information in a statistically rigourous manner to compare 
the chances that, given the observed data, different hypotheses are 
true. For example, here we have used Bayesian statistics to deter- 
mine which of the HO and HI hypotheses are most likely, where the 
HI hypothesis assumes there is a signal in the data. It is important 
to point out that we are still not determining the probability that a 
detection is a solar mode of oscillation as the HI hypothesis does 
not specify the source of the signal. Any observed signal could con- 



Statistical searches for low -frequency solar oscillations 3 



ceivably be some sort of structured noise. However, the Bayesian 
approach still gives probabilities that are less open to misinter- 
pretation than the frequentist approach. Furthermore, significance 
estimates are much more conservative than those obtained with 
frequentist methods and so should lead to fewer false detections. 



3 SEARCHING CONTEMPORANEOUS DATA USING 
FREQUENTIST STATISTICAL TECHNIQUES 

The signal from solar oscillations will be common to contempo- 
raneous data observed by different instruments. It is, therefore, 
pertinent to search frequency-amplitude spectra, constructed from 
contemporaneous data, for statistically significant, prominent con- 
centrations of power, which lie significantly above the local noise 
background and that are coincident in frequency in different data 
sets. In Section|7]we show the results obtained when contempora- 
neous BiSON, GOLF and MDI data were searched. 

It is possible to determine the probability of observing, by 
chance, prominent features that are coincident in frequency in two 
frequency-amplitude spectra. Proper account must be taken of any 
noise that is common to the two sets of data since the presence of 
common noise increases the probability of observing a coincident 
prominent noise feature. When BiSON data are compared with 
GOLF or MDI data any common noise is solar in origin and, more 
specifically, is due to solar granulation. When GOLF and MDI data 
are compared some of the common noise may also be instrumental 
as both are onboard the SOHO spacecraft. The presence of common 
noise means that the probability of observing coincident prominent 
features by chance is non-trivial to derive. A derivation of the joint 
probability, p, of the occurrence of coincident pr ominent features 
in two frequency- amplitude spectra is given in iBroomhall et al.l 
( l2007l) . In order to make this derivation it was assumed that the 
background is slowly varying in frequency with no sharp features 
and that the real and imaginary noise in the frequency-amplitude 
spectrum has a Gaussian distribution, as is observed. 

The probability of observing, by chance, a coincident, prominent 
feature at least once over a region of A' bins in two frequency- 
amplitude spectra is 

P=l-(l-p)", (1) 

where again we recall that p is the joint probability for finding a 
coincident feature in a single bin. A low value of P indicates that a 
feature is statistically prominent compared to the background noise 
and, as such, is worth consideration as a 'mode candidate'. It is 
important to remember that this is fundamentally different from 
determining the probability that a detection is a mode. 

When determining the significance of any observed prominent 
features the value of N in equation (TJ must be fixed. As N in- 
creases, the probability of detecting a prominent feature by chance 
in a single set of A' bins increases and therefore the size of N must 
be suitably capped. However, as N decreases the total number of 
strips containing N bins in the frequency range of the frequency- 
amplitude spectrum that is searched increases. Consequently, the 
probability that at least one of these A' bin ranges contains a false 
detection also increases and so should not be too small. When 
deciding on the optimum value of A' a balance between these two 
requirements must be sought. In the results that follow A' was taken 
to be the number of bins in 100 pHz. 

Various statistical tests are described in lChaplin et al.l ( |2002|) that 
determine the probability of observing prominent structures in a 
single frequency-power spectrum. These tests can be easily adapted 



so they can be used to compare two frequency-amplitude spectra if 
the probability of observing a single prominent spike at the same 
frequency in each of the frequency- amplitude spectra can be calcu- 
lated (Broomhall et al. 2007i). These tests were originally designed 
to detect p modes, however, the underlying theory is still applicable 
when searching for g modes. An outline of the statistical tests that 
were applied here will now be given. 

We define a spike as a single bin with a prominent amplitude 
in a frequency-amplitude spectrum. A 'pair of spikes' is then a 
prominent spike that occurs in the same frequency bin in each of 
two frequency-amplitude spectra. The simplest test that was used to 
compare the BiSON, GOLF and MDI data searched for individual 
prominent pairs of spikes. This test makes no a priori assumptions 
regarding the properties of the modes. Additional tests have also 
been formulated which do take advantage of known mode proper- 
ties that are observed in frequency-power/amplitude spectra. 

One such test searches for a cluster of prominent spikes. For 
example, we have searched for three coincident prominent spikes 
over a range of six bins. This test is designed to take advantage of 
the fact that solar oscillations are damped and the power of a mode 
may be spread across more than one frequency bin. The observed 
width of a mode is dependent on the mode's lifetime and the length 
of the respective data set. Although the lifetimes of low-frequency 
p modes are uncertain, evidence from higher-frequency p modes 
implies that in a time series of length 8.5 yrs, which is the length 
of the data series analysed h ere, even very low-freq uency p modes 
could have resolved widths jBroomhall et al.ll2008l) . The lifetimes 
of g modes are less predictable and so it is uncertain whether or not 
the width of g modes will be resolved. Even if a mode's lifetime 
is greater than the length of the time series it is still possible for 
its power to be spread across more than one bin for the following 
reason. If the mode signal is not commensurate with the window 
function of the observations, its maximum amplitude will be di- 
minished and the majority of its amplitude will be divided between 
two consecutive frequency bins. With this in mind, a statistical test 
was also derived that searched for two consecutive prominent pairs 
of spikes. 

Another statistical test that was used to search the BiSON, GOLF 
and MDI data looked for prominent structures whose patterns 
mimic those expected from the rotational splitting. In Sun-as-a-star 
data only mode components where I + m is even are visible and 
so the statistical test searched for components whose azimuthal 
orders, ra, were separated by 2. This test assumed a separation 
in frequency between adjacent m components of 0.4 ± 0.1 /jHz. 
The error of ±0.1yuHz is included to allow for the influence of 
magnetic fie l ds, wh ich shift the observed frequencies of the modes. 
iGarcia et al.l ( |2004) found that the splittings of p modes can vary 
between 0.38 and 0.45 yuHz, both of which are within the range of 
splittings allowed here i.e. 0.4 ± 0. 1 yuHz. 

We use a split ting of 0.4 ± 0.1 //Hz to maintain consisten cy with 
previous works jchaplin et al I2OO2I : IBroomhall et"ai]|2007l) . How- 
ever, it should be noted that since the rotation in the core is not 
well known this value may not repr e sent t he synodic splitting of 
g modes. For example, iGarcia et al.l ( |2007|) suggest that the core 
may be rotating between 3 and 5 times faster than the radiative 
zone. We note here that one could question the how appropriate it 
is to use this test with a frequentist approach since we are using a 
priori knowledge. Therefore, although the results of the multiplet 
test are inclu ded here for completeness and consistency with pre- 
vious works jChaplin et al .I2OO2I : IBroomhall et al.l200% we regard 
the results with suspicion. 

More detailed descriptions of the statistical tests can be found in 
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IChaplin et al. I( l2002h and lBroomhall et aL We now describe 

how Bayesian statistics can be used to assess the significance of any 
mode candidates. 



USING BAYESIAN STATISTICS TO SEARCH FOR 
SOLAR OSCILLATIONS 



lApDourchauxl ( l2008h describes how Bayesian statistics can be 
used to determine the fr equencies of modes in solar data and 
lApDourchaux et aT I ( l2009h uses Bayesian statistics to search aster- 
oseismic data observed by the COnvection Rotation and planetary 
Transits (CoRoT) spacecraft for e vidence of oscillations. We now 
summarize the methods used by lAppourchaux et al] ( l2009h and 
adapt them so that they can be used to search contemporaneous 
solar data. 

Bayes theorem can be used to determine the 'posterior probabil- 
it y', p(HO|j), that the HO hyp othesis is true given the observed data, 
X. lAppourchaux et alj l l2009f) shows that the posterior probability is 
given by 



pim\x) 



I + 



(1 - Po) p{x\m) 



po p{x\ HO) 



(2) 



where po is the prior probability, which is defined as the subjective 
probability, before any observations have been made, that HO is 
true. The HO hypothesis assumes that the data contains only noise. 
Therefore, in equation Q, p(x|HO) is the probability that a spike 
with a power x is observed in a frequency-power spectrum that 
contains only noise. Similarly, p(x|Hl) is the probability that a 
spike with a power, x, is observed if the Hi hypothesis is true. Here, 
the HI hypothesis assumes that there is a signal in the data with a 
maximum power spectral density per unit bin, H. We are assuming 
that HO and HI cover all possibilities. In other words ther e is no 
third hypothesis, H2 say, that is true. lAppourchaux et ah I 1 I2OO9I) 
derives the expressions for p(x|HO) and p(.t|Hl) that can be used to 
search frequency-power spectra. We now quote these expressions 
for reference. 

We define a spike as the power spectral density contained in 
a single bin of the frequency-power spectrum. Frequency-power 
spectra have a x^, 2 d.o.f. distribution and so the probability of 
observing a spike with a power spectral density of x if the data 
contain noise only is given by 



p(x|HO) : 



(3) 



where we have taken the mean power spectral density of the spec- 
trum to be 1. For the alternative Hi hypothesis we assume that the 
spike contains the signal from a mode which has been stochastically 
excited, as is the case for solar oscillations. However, as we do not 
know the power of the mode we assume, a priori, that the maximum 
power spectral density per unit bin of the mode in a frequency- 
power spectrum is distributed uniformly between and H. The 
probability of observing a spike with a power spectral density, x, 
is then given by 



p(x|Hl) = 1 r -^e-/<'-"'' dH'. 



(4) 



Equations ^ and ^ can be substituted into equation lO to give 
the posterior probability. 
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Figure 1. The variation of the posterior probability with the maximum 
power, H, for different threshold powers, x. 



4.1 Range of heights 

In order to calculate p(jc|Hl) we have to make assumptions about 
the maximum power spectral density per unit bin of a mode profile, 
H, with a frequency, v. Fig. [T] shows the variation with H of the 
posterior probability, which is related to p(x|HI) by equation (O, 
for power spectral densities, x. The choice of H is most influential 
at low H and high x and so it is important to choose the value of H 
carefully to enable the significance of any prominent feature to be 
accurately assessed. As we are searching for oscillations that have 
not yet been observed we need to use models to estimate the value 
of H. 

Notice that for each x plotted in Fig. [T] a minimum posterior 
probability is observed. Such minima can be understood as a man- 
ifestation of the fact that the prior on H being large is contra- 
dicted by the low power s pectral density level reached by the peak 
( lAppourchaux et alj2009l) . In other words, if the mode power spec- 
tral densities were truly so high the observed peak would also be 
prominent. 

Before we discuss the values used to constrain H it is important 
to define exactly what we mean by the maximum power spectral 
density per unit bin of a mode. In this paper we define the maximum 
power spectral density as the peak height of a mode's profile in a 
frequency-power spectrum. Solar oscillations are damped and, if 
the lifetime of a mode is significantly shorter than the length of the 
observations, the mode will appear in a frequency-power spectrum 
in the form of a Lorentzian. The width of the Lorentzian, A, is 
related to a mode's lifetime, t, by the following equation: 



(5) 



The maximum power spectral density per unit bin (or heigh t) can 
be described by the following equation jFletcher et al]|200d) : 



H = 



2V^ 



kTA + 2 



(6) 



where is the total power of the mode, T is the length of the time 
series and A is in units of frequency. Consequently, TA gives the 
width of the Lorentzian in units of bins. 

For p modes the value of H was esti mated in two differen t 
ways, each of which will now be described. lBroomhall et al. 
extrapolated to low frequencies the total mode powers, V^, and 
widths. A, of well-defined, higher-frequency modes, whose pro- 
files in a frequency-power spectrum can be fitted accurately. The 
total power, V', was extrapolated in preference to H as it is not 
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dependent on A. lBroomhall et all did this in two different ways and 
tiere we have used the method that produced the largest H values. 

Even so, it is possible that this method under-esti mates the power 
of the signal from a mode. For example, Broomh all et al.l ( 1200 Sh 
found that the / = 0, « = 6 mode at ~ 972yuHz is excited to 
a larger power than is expec ted from the ext rapolations. In fact, 
the simulations performed bv lBroomhall et aO indicated that, if the 
actual power was equal to the value given by the extrapolation, 
the probability that the / = 0, ;i = 6 mode would be excited to 
the observed power is approximately 0.01. Furthermore, because 
solar oscillations are excited stochastically, the total power of a 
mode, V^, varies with time. If the value of the power given by the 
extrapolation is regarded as a mean, the observed power of a mode 
is equally likely to be greater than this value as it is to be lower. 
Consequently, by integrating over the range to // we could be 
underestimating the true height of the mode. More realistic prob- 
abilities may be obtained if we overestimate the range of heights 
over which the integration in equation l|4j is made. Therefore, we 
have also performed a search where the maximum power spectral 
density per unit bin, H, was assumed t o be constant at a l l frequ en- 
cies and to have the height predicted bv lBroomhall et all ( 1200 8l) for 
a mode with a frequency of 1500yuHz. A larger value of H should 
allow signals to be detected more easily as when H tends to 0, 
p(x|Hl) tends to p(x|HO) and so it is very difficult to determine 
which hypothesis is most likely to be correct (see Fig. [T). 

As for low-frequency p modes we are reliant on models to predict 
the powers of g modes. Two groups have produced predictions of 
g-mode powers. The first group uses the Cambridge stochast ic ex- 
citation model, as applicable to g mo des dHoudek et al.lll999l) . The 
second group jBelkacem eTai]|2009h uses a similar model with a 
diff^ere nt temporal-corre lation between the modes and the turbulent 
eddies: iBelkacem et al.l take the eddy- time correlation function to 
be a Lorentzian, while iHoudek et alj assume that the eddy-time 
correlation is a Gaussian. The authors of both papers feel that 
their respective models are only applicable to limited ranges in 
fr equency and the se ranges do not coincide. However, the work 
of IBelkacem et al.| p redicts signific antly larger g-mode powers than 
those predicted by iHoudek et al.l Therefore, we have taken the 
up per limit for H to be th e maximum power predicted by the work 
of (Belkacem et alj ( |2009| ) as the larger the mode powers the more 
likely we are to be able to detect any oscillations. We have taken 
the total power, V^, to be the constant value of 9mm^ s"^ at all 
frequencies in the range (50-350) yuHz. We have then assumed that 
the lifetime of the g modes is significantly longer than the length of 
the observations and so all of the modes power will be confined to a 
single bin. Therefore the height of the mode, H, is equal to the total 
power, i.e. H = 9 mnr s"^ bin" IBelkacem etal.l ( l2009h note 
that, given the uncertainties in their model, this maximum power 
could, potentially, be a factor of four larger. Therefore, we have also 
determined the posterior probability given the observed prominent 
features when the maximum height, H, was 36 mm' s"^ bin"' . 



4.2 Searching a range of frequencies 

The statistics described above can be used to search individual bins 
of a frequency-power spectrum. However, there are ~ 400, 000 bins 
in the total range of frequencies searched here (50 - 1500yuHz). 
Consequently the chances of making a false detection are 
significant when searching each bin individually. We therefore 
consider a ra nge of fre q uencie s containing M bins. In the manner 
described by ISturrockl ( 1200 8l) we define the following null and 
alternative hypotheses: 



HOO: The principal peak in M bins is due to noise, 

Hll: The principal peak is due to an oscillatory signal with a 

frequency, v. 



Then 

P(x\mO) = 1 - [1 - P(x\m)f ^ MP(x|HO), 



(7) 



when P(x|HO) is small. We assume only one frequency bin out of 
the M of interest contains a signal and so 



P(x\mi) = P{x\m). 



(8) 



Therefore, 



p(HOO|x) 



I + 



I + 



(1-po) p(-iiHll) 



po p{x\mo) 
(1-Po) p(x\m) 



Po Mp(x|HO) 



(9) 



To ensure that only one mode component is present in the M bins 
we have taken M to be the number of bins in 0.4yuHz = 106, 
as this is the expected splitting between adjacent m components. 
Note that we have erred on the side of caution when setting this 
value of M as in Sun-as-a-star data only (/ + /n)-even components 
can be detected. Therefore, the average splitting between adjacent 
visible components is closer to 0.8 yuHz. We have assumed that the 
components of g modes are also rotationally split by OApKz. It 
should be noted that the core rotation rate is v ery uncertain and so 
this assumption may not be valid. For example. iGarcfa et al. I( l2007l) 
suggest that the core might be rotating between three and five times 
faster than the radiative zone. However, even if this is the case there 
will still be only one component present in the M = 106 bins under 
consideration. One possible extension to this work would be to test 
the data taking into account a faster core rotation rate. 



4.3 Searching smoothed spectra using Bayesian techniques 

The lifetimes of p modes are expected to be short enough for their 
profiles to have a resolved width in the frequency-power spectrum 
examined here. Until now we have assumed that g modes have 
very lo ng lifetimes but this is not necessarily correct. lOarcfa et al.l 
1 I2OO7I) have found a very interesting peak in a periodogram of a 
periodogram of GOLF data. They claim that this peak is due to the 
superposition of g modes that lie in the asymptotic range where 
the separation in period is approximately constant (~ 24min). The 
peak is positioned cl ose to the period predicted by standard solar 
models. iGarcfa et al] use artificial data to show that, if this peak is 
evidence of g modes, the modes might have widths in a frequency- 
power spectrum that are commensurate with damping times of 
several months. However, this is not the only explanation for the 
observed mode widths. The observed results could also be due to 
an internal magnetic splitting, changes with time of the cavities in 
which the oscillations were trapped or a noise effect in the GOLF 
data. If the damping times are of the order of months and given 
that the data searched here are 3071 d in length, the width of these 
m odes will be resolved in fre quency-power spectra. 

lAppourchaux et al. I ( I2OO9I) describe how Bayesian statistics can 
be used to search for modes with resolved widths. This is done 
by summing the frequency-power spectrum over a given number 
of bins. Smoothing a spectrum alters the underlying statistics of 
the noise and so it is necessary to derive new equations for p(x|HO) 
and p(x|Hl). If we smooth a frequency-power spectrum over R bins 
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the underlying statistics of tlie noise becomes with 2R d.o.f.. 
We have used the equation s for p(x|HO) and p ( .t:|Hl) , which are 
described in Section 3.2 of lA ppourchaux et al ] ( l2009h . to search 
solar data for modes with lifetimes that are shorter than the length 
of the time series. We now give the equations for reference. 

The probability that, in a spectrum that has been smoothed over 
R bins, a spike with a power spectral density of x is observed if the 
HO hypothesis is true is given by 



p{x\m) = 



(10) 



where T(R) is the gamma function and we have assumed that the 
mean of the unsmoothed frequency-power spectrum is unity. For 
the alternative hypothesis we again assume that the spike contains 
the signal from a stochastically excited mode. However, when deal- 
ing with smoothed spectra, instead of making assumptions about H 
we must make assumptions about the total mode power, V^, and 
the mode linewidth, A. Here we assumed, a priori, that the mode 
amplitude is distributed uniformly between and V and that the 
mode linewidth is distributed uniformly between and A. Then the 
probability of observing a spike with a power spectral density x in 
a smoothed spectrum, if the HI hypothesis is true, is given by 



p(x|Hl) : 



VAJo Jo 



A" 

tTvY 



(11) 



where A and v are given in I Appourchauxl j2004h and are related 
to the line profile of the mode, i.e. they are related to V and A. 
Equations jlOt and dl il l can be substituted into equation ^ to give 
the posterior probability. 

When searchin g for p modes, we have used the values for V- and 
A extrapolated bv lBroomhall et al. liooi) as the upper limits on the 
mode powers and lifetimes. When searching for g modes we took 
to be both 9 mm^ s^ and 36 mm- s^ (see Section|4T|for details). 
As the lifetimes of the g modes are not known we used a range of 
different values: 1, 2, 3, and 4 months. Equation ([9} was then used 
to determine the posterior probability. The posterior probability, 
given by Equation ((g}, is also dependent on the prior probability, 
po, and we now discuss the value given to po. 



5 WHAT VALUE SHOULD THE PRIOR PROBABILITY 
TAKE? 

One important question when applying Bayesian statistics is what 
value should the prior probability, po, take? We define po as the 
subjective probability that the HO hypothesis is correct, as assigned 
before the data are examined. If po is set at a very high (or very low) 
value any statistical tests are likely to just confirm the initial belief 
that HO (or HI) is correct. Therefore, it is common to set po = 0.5 
so as to avoid prejudicing one hypothesis over the other. However, 
we know that low-frequency p modes have narrow widths, which 
cover at most a few bins only. In which case, would we expect the 
probability that HI and HO are true to be the same at all frequen- 
cies? Is it possible to use our knowledge of the solar structure to tell 
us which hypothesis is more likely to be true at a given frequency? 
Can we also use this knowledge to then systematically guide the 
regions in frequency we wish to search? 
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Figure 2. An echelle diagram showing the frequencies predicted by differ- 
ent solar models. At the centre of each row, the black symbols represent the 
frequencies of the 1 = modes, the red symbols represent the 1=1 modes, 
the green symbols represent the / = 2 modes and the blue symbols represent 
the 1 = 3 modes. All components with l+m even have been plotted assuming 
a symmetric synodic splitting of 0.4 /iHz. The purple curve shows the fre- 
quency distribution of the models. Towards the top of each row the m = 
comp onents of the Saclay seismic m odel frequencies jXurck-Chieze et alj 
I2OOIL blue crosses), the Ml model (Zaatri et al. '2007? orange triangles) 
and Model S frequencies (Chi'istensen-Dalsgaard & Berthomieu. 1991.. pink 
plus signs) are shown for comparison purposes. 



5.1 A non-uniform prior that can be used to search for 
p modes 

We have looked at the p-mode frequencies pred i cted by 5000 
solar models (see iBahcall. Serenelli & Basul I2OO6I : Ichaplin et al.l 
I2OO7I for details), whose input parameters, such as composition 
and age, have been altered within their error bars. The frequen- 
cies of the modes predicted by the models have been plotted on 
an echelle diagram in Fig. |2] Different colours have been used 
to represent the different / and the predicted frequencies of all of 
the visible components (when I + m is even) are plotted. We have 
taken the synodic splitting to be 0.4 ^Hz and symmetric, which is 
a reasonable a ssumption for the fr equency range of interest here 
(< 1500 yuHz, IChaphn et al.]|2002l) . Although the rotation rate of 
the solar core is uncertain this assumption remains valid as the 
p mode rotational splitting is relatively insensiti ve to the rota- 
tion speed of the core (see e.g. lOarcfa et al.ll2008i) . For reference 
the centroid frequencies predicted by the Saclay seismic m odel 
jTurck-Chieze et alj|200ll). the Ml mo del feaat ri et alj|2007l) . and 
Model S jChristensen-Dalsgaard & B erthomieu 1199 ih have been 
included. The purple curve in Fig. [2] shows the shape of the distri- 
bution of the model frequencies. 

The frequencies predicted by the models for each individual 
mode are spread over a reasonably large range, indicating the 
uncertainties associated with solar modelling. However, there are 
definite gaps in frequency where none of the models predict a mode 
should be detected. Solar models would have to be changed beyond 
reasonable expectations to account for any mode frequencies that 
lie outside the predicted ranges. In the frequency ranges where no 
modes are expected, we can set the prior probability that HO is 
true to unity thereby making it impossible for a prominent feature, 
which is likely to be noise, to pass any statistical tests. We have, 
therefore, defined an inverted top-hat prior that is equal to 0.5 in 
the frequency ranges where, according to the models, the modes 
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Figure 3. Top panel: An echelle diagram showing tlie variation of the prior 
probability with frequency for p modes. Bottom panel: An echelle diagram 
showing the variation of the of the prior probability for g modes. In each 
panel the prior probability was either 1.0, at frequencies where modes 
ai'e not expected to be observed, or 0.5 at frequencies where modes are 
expected. 



should be observed and po = 1 at all other frequencies. Such a prior 
is shown in the top panel of Fig. [S] This approach should severely 
limit the number of false detections that are made. 



5.2 A non-uniform prior that can be used to searcli for 
g modes 

The sensitivity of g- mode frequency prediction s to solar models 
is of the order of 1% jAppourchaux et al]|2010l) . This means, for 
example, that at 100/jHz the expected error on the predicted mode 
frequencies is 1 yuHz. We have used this error estimate to produce 
two inverted top-hat prior distributions for g modes. For the first 
prior the centres of each hat are determined by the mode frequen- 
cies predicted by the Ml model feaatri et al.ll2007h and the width 
of each hat was 2 per cent of the mode frequency i.e. ±1 per cent. 
For the second prior, the width of each hat was still 2 per cent of the 
mode frequency, however, we have used the f requencies predicted 
by the Saclay solar model jMathur et all 20071) . Frequencies where 
modes are expected to be observed have been given a prior proba- 
bility that the HO hypothesis is true of 0.5 and elsewhere the prior 
is unity. The bottom panel of Fig. [3] shows the Ml model prior dis- 
tribution, where all visible components in Sun-as-a-star data (when 
I + in is even) have been plotted for Z = 1 and 1 = 2 modes. We have 
not plotted the Saclay model prior here as it is very similar to the 
Ml model prior. We have assumed that the different m components 
ar e still separated by .4 /jHz. As previously mentioned, the results 
of lGarcia et al. U2OO7I) imply that this assumption may not be valid 
for g modes. In the absence of a definitive estimate for g-mode 
splittings, we have chosen to maintain consistency with the splitting 
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Figure 4. The coherency observed between BiSON and GOLF data when 
the GOLF data starts 24 h after the BiSON data. 



value used when searching for p modes. However, as new infor- 
mation on the rotational splitting of g modes becomes available 
an updated uneven prior probability should be constructed. Notice 
that despite the large number of g modes that are present at low 
frequencies the prior is not uniform with frequency and so, again, 
this approach should limit the number of type I false detections. It 
is possible that an improved g-mode uneven prior probability could 
be constructed by producing models for g modes that are similar to 
those used to construct the p-mode uneven prior probability. 



6 SEARCHING MORE THAN ONE DATA SET USING 
BAYESIAN STATISTICS 

As mentioned in Section [3] iBroomhall et al.l ( |2007|) developed the 
statistics necessary to allow two sets of contemporaneous data to 
be compared using a frequentist approach. This has been done both 
to lower the amplitude/power threshold levels and to improve the 
reliability of any detection because if a prominent feature is found 
in data observed by two different instruments it is more likely to 
be solar in origin (although it should be noted that there may be 
some consistent spacecraft signatures in the GOLF and MDl data). 
However, it is worth mentioning that by comparing the data we 
are limited by the signal-to-noise ratios of the worst instrument. 
For example, there is a known instability problem with the MDl 
instrument that causes low-frequency noise. Consequently, some 
mode candidates that are only prominent in one data set have not 
been highlighted here. This could increase the number of type 11 
errors but, conversely, will limit the number of type 1 errors, which 
we consider to be more serious than type 11 errors. In this paper, we 
aim to use Bayesian statistics to compare multiple data sets. Since 
contemporaneous BiSON, GOLF and MDl data will contain some 
common noise (see Fig.|4j, the probability that a prominent spike, 
which is found in the same frequency bin in each frequency-power 
spectrum, is due to noise is non-trivial to derive, whether using a 
frequentist or Bayesian approach. We offer a solution that avoids 
the need to consider the effect on the statistics of the noise shared 

by each data set. 

Fig.|4]shows the coherency telsworth et alj|l994h of the BiSON 
and GOLF data when the GOLF time series begins 24 h after the 
BiSON data. The separation in start times means that the coherency 
of the noise observed between the data is reduced to zero below 
1500yuHz and the same is true for the other combinations of the 
BiSON, GOLF and MDl data. Therefore, separating the start times 
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of the data sets removes the comphcation caused by the common 
noise. When no common noise is shared by the two data sets under 
consideration the posterior probability, p(HOO|x), that, if a feature 
appears in both frequency-power spectra, HOO is correct, is given 
by 



p(HOO|x) 



1 + 



(1-po) p(^i|Hl)p(x2|Hl) 



po Mp(.i-i|H0)p(x2|H0) 



(12) 



where x\ is the power spectral density of a spike in the first 
frequency-power spectrum and X2 is the power spectral density of a 
spike at exactly the same frequency in the second frequency-power 
spectrum. When comparing two sets of non-contemporaneous data 
for coincident prominent features we are assuming that the mode 
signatures will still be coherent if we shift the start times and we 
now show that this assumption is valid at low frequencies. 

6.1 The coherency of modes in shifted spectra 

Simulations were performed to investigate how a modes's co- 
herency changes in non-contemporaneous data. The modes were 
simulated by c reating a damped os cillator that was re-excited ran- 
domly in time jChaplin et al.lll997h . The power and damping time 
of each mode were determined by fitting the Bi SON data us- 
ing a standard likelihood maximization method (e.g. lChaplin et al.l 
1 19991) . Simulations were performed for / = 1 modes with 10 < n < 
22 and a separate simulation was performed for each mode. These 
modes are at higher frequencies than the ones we are searching for 
in this paper. However, they have a large S/N ratio and so their 
properties, such as lifetime and power, can be determined more 
accurately. The simulations were performed using two identical 
time series that contained the signal from one simul ated mode only. 

Th e top panel of Fig. [5] shows the coherency ( lElsworth et al.l 
of the modes when the start times of the simulated time 
series were shifted so that they were separated by (5/ = h (black 
plus signs), 6t = 12 h (black crosses) and 6t = 24 h (black dia- 
monds). The coherency of two unshifted identical sets of data is 
always unity, independent of the power and width of the simu- 
lated mode. The top panel of Fig. |5] shows that the effect on the 
coherency of a mode of separating the start times of two data sets 
is frequency dependent. More specifically (bottom panel of Fig.[5j 
the magnitude of the decrease in coherency is lifetime dependent. 
The coherency of a low-frequency mode, with a long lifetime, is 
reduced by less than the coherency of a higher-frequency mode, 
with a short lifetime. This is because, as the time shift represents 
a smaller proportion of a low-frequency modes' lifetime, the mode 
experiences less damping over the duration of the shift. The mag- 
nitude of the decrease in coherency increases as the length of the 
shift in start times increases relative to the mode lifetime. However, 
is the decrease in coherency observed by modes below 1500yuHz 
significant when a time shift of 24 h is introduced? 

Modes with a frequency of about 1500pHz have a lifetime of 
approximately 1 month and so 24 h is equivalent to 1 /30th of the 
mode's lifetime. Therefore, for each of the modes plotted in Fig. [5] 
the start times of the two simulated time series were separated by 
one-thirtieth of the mode's lifetime. The triangle symbols in the top 
panel of Fig. |5] shows that this only decreases the coherency by a 
very small amount at all frequencies. Fig.|6]plots the variation in the 
coherency with St/r for three / = 1 modes. The values of coherency 
plotted in Fig.[6]were determined using the same simulated spectra 
as were used to produce Fig. |5] Fig. |6] shows that if 6t/T is small, 
the coherency decreases by only a small amount. Notice that size of 
the decrease in coherency shows some dependence on n. However, 
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Figure 6. The variation in the coherency of three modes with St/r. The 
results have been plotted for the / = 1, n = 10 mode, whose frequency 
is approximately 1612 /iHz, the / = 1, n = 16 mode, whose frequency is 
approximately 2425 /jHz and the / = 1 , n = 22 mode, whose frequency is 
approximately 3233 /iHz. 
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this dependence is small and is likely to occur because the power 
of the modes decreases with n. We conclude that, if the start times 
of two time series are separated by 24 h the low-frequency modes 
will still be coherent and so the data can be compared to search for 
low-frequency oscillations. 

Note that it is possible to separate the start times of the time 
series by less than 24 h and still significantly reduce the amount 
of common noise shared by the data. However, we are dealing 
with long data sets and so 24 h is only a small fraction of the total 
observation time. Therefore a shift of 24 h was used to ensure that 
there was as little common noise present as possible. 

A table summarizing the main assumptions made with both the 
frequentist and Bayesian approaches can be found in Appendix lAl 
We now describe the results of using both the frequentist approach 
and Bayesian statistics to search BiSON, GOLF and MDI data. 



7 RESULTS OF SEARCHING PAIRS OF DATA SETS FOR 
LOW-FREQUENCY P MODES AND G MODES 

7.1 Observational data 

In this paper we made use of contemporaneous BiSON, GOLF and 
MDI time series. Each data set consisted of 3071 d of Sun-as-a- 
star Doppler velocity observations, made between 1996 April 20 
and 2004 September 15, and so the data span most of solar activity 
cycle 23. We have used data sets that extend only a s far as 2004 

( l2007l). The BiSON 



Appourchaux et all 



to maintain consistency with lBroomhall et"al] 
data w ere p rocessed in the mann er described by 
( l2000h and lChaplin et alj ilOO'j). The GOLF data were processed 
in the manner described bv loarcia et al.l | |2005|) . and the MDI- 
Luminosity Oscillations Im ager (LOI) proxy data were processed 
in the manner described by IScherrer et a i] jl995l) . When consid- 
ered individually each time series is stored on a different cadence: 
BiSON data are stored on a 40 s cadence, GOLF data are stored 
on a 20s cadence and MDI data are stored on a 60s cadence. 
Each time series was, therefore, re-binned to a common cadence 
of 120 s. Hence the time series contained 221 1 120 samples, giving 
a bin width in the frequency domain of 0.004 yuHz. It is possible 
that some aliasing could be present in the data due to the cut- 
off frequency of 4167yuHz. However, no aliasing was found in the 
frequency range searched here i.e. between 50 and 1500/jHz. The 
duty cycle of the BiSON time series was 78.6 per cent; the duty 
cycle of the GOLF time series was 93.4 per cent and the duty cycle 
of the MDI time series was 91.3 per cent. 



7.2 Using a frequentist approach 

The frequentis t stat istics described in Section [3] and 
iBroomhall et alj | |2007|) were used to search contemporaneous 
BiSON, GOLF and MDI frequency-amplitude spectra for low- 
frequency p modes, g modes and mixed modes. Amplitude 
threshold levels were calculated for a 1 per cent probability 
of detecting a prominent feature by chance in 100 /jHz. The 
spectra were searched for prominent spikes or patterns of spikes 
found in the same frequency bin or bins in any two of the 
frequency-amplitude spectra. 

If the mode signal is not commensurate with the window function 
of the observations the maximum observed amplitud e of the mode 
can b e diminished. The 'bin-shifting' strategy of I Chaplin et al.l 
( I2OO2I) was therefore applied in an attempt to circumvent this prob- 
lem. 
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Figure 7. An echelle plot, modulo 145 /iHz, marking locations in frequency 
of occurrences uncovered by the test searches. Locations in frequency 
where spikes, or patterns of spikes, were found in the same bin, or bins, in 
BiSON and GOLF frequency-amplitude spectra at levels sufficient to record 
P <\ per cent are marked by the black symbols in the middle of each row. 
A different symbol has been used for each test (see figure legend). We have 
also recorded prominent spikes or patterns of spikes found by comparing 
either the BiSON and MDI frequency-amplitude spectra (red symbols at 
top of each row) or the GOLF and MDI frequency-amplitude spectra (blue 
symbols at bottom of each row). Symbols surrounded by a green square 
represent the prominent occun'ences listed in Table [T] The orange vertical 
lines mark locations of the frequencies of p modes with / = — 3 predicted 
by the Saclay seismic model (Turck-Chieze et al. 2001). The vertical purple 
lines mark locations of the central frequencies of g modes with / = 1 - 2 
predicted by the Ml model iZaatri et al..200^) . 



Fig. |7] shows visually, in the form of an echelle diagram, the 
locations in frequency where prominent spikes or patterns of spikes 
were observed. The entire range that was searched (50 - 1500 yuHz) 
has been split into strips of 145 yuHz. These strips have then been 
placed one above another. The bottom left-hand corner of the plot 
represents 50yuHz and the top right-hand corner of the plot repre- 
sents 1500 yuHz. A repeat frequency of 145 /jHz was chosen in pref- 
erence to the lOOyuHz slice on which the searches were performed 
because at low frequencies consecutive overtones of the low-/ 
modes are separated in frequency by approximately this amount. 
At higher frequencies the spacing between modes with the same / 
and adjacent n\i ~ 135 ^Hz, however, this separation increases at 
low frequencies (low n). The choice of repeat frequency means that 
the majority of the predicted p-mode frequencies are arranged in 
four near- vertical strips, one for each / between and 3. 

Artifacts, positioned in frequency at overtones of the 11.57yuHz 
diurnal frequency, may be visible in frequency-amplitude spectra 
because of signatures of either the window function (for the BiSON 
data) or the spacecraft operation (for the GOLF and MDI data). To 
prevent any of these artefacts being confused with mode candidates, 
any detections found to be closer than 0.3 yuHz to a daily harmonic 
frequency were discounted. A distance of 0.3 /jHz is sufficient to 
remove any false detections due to the daily harmonics as their 
frequencies are well defined and their peaks have a width of less 
than 0.3 yuHz. The MDI data contain harmonics of 52.125 yuHz due 
to beats between the spacecraft timing system and the instrument 
sampling rate. Any detections found when the MDI data were 
searched that lay within 0.3 /jHz of these harmonic frequencies 
were also discounted. 

Detections were only considered as possible mode candidates if 
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Table 1. Candidates found to be closer than 1//Hz to the predicted frequencies of p modes from the Saclay seismic model and the predicted frequencies of 
g modes from the Ml model. P is the probability that, assuming the data contains only noise, a coincident prominent feature is observed at least once in 
100 /jHz. The last column of this table indicates whether the modes were also detected using a Bayesian approach (see Sections|T3]and|T4). 
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9 


+ 1 


1473.269 ± 0.009 


3.7 X lO-** 


4 




y 






mean 


1472.851 ±0.006 






0.122 





"Difference with model frequency assumes m = ± 1 components lie ±0.4 /jHz from central frequency. 
'Difference with model frequency assumes m = -2 component lies -0.8 fiHz from the central frequency. 



they were positioned within 1 yuHz of a predicted mode frequency. 
A maximum separation between a detection and a model frequency 
of 1 /jHz was allowed to account for any uncertainties in the model 
frequencies, any shifts due to the effects of magnetic fields, and 
because the power of some modes is spread across more than one 
frequency bin. It should be noted that Figs[2]and[3] which show the 
frequencies predicted by different solar models and the uneven pri- 
ors used in the Bayesian approach, suggest that the allowed differ- 
ence with the model should be larger than ± 1 /jHz. However, it was 
felt that an increase in the allowed difference between the model 
and the observed frequencies would significantly increase the num- 
ber of type I false detections that were considered as mode candi- 
dates. Any detections that lie within 1 /jHz of a model frequency 
are surrounded by a green square in Fig. |7] and are listed in Table 
[T] We have compared the observed mode candida t es with both 
the S aclay seismic model jTurck-Chieze et a i]|200ll: iMathur et al.l 
l2007h and the Ml model frequencies jZaatri et alj2007[) . However, 
we have not plotted all of these frequencies in Fig.|7]as both models 
predict similar frequencies. In cases where the outer components of 
a rotationally split mode appear to have been uncovered, the mean 
of the component frequencies has been determined, in order to give 
an estimate of the centroid frequency of the mode. Estimates of the 
un certainties in frequen cy were calculated in the manner described 
bv lChaDlinetaLl ( l2002l) . 

In the p-mode frequency range all of the listed c andidates cor- 
respond to previously claimed detections (see e.g. [Toutain et al.l 
I998l:lBertello et al.ll200(]|: lOarcfa et alJl200lUChaplin et al.ll2002l : 



Broomhall et alJl2007l;ISaIabert et al.ll2009h . One of the detections 



in the g-mode range lies close to the predicted frequencies. To 
improve confidence in the detections we required that each mode 



candidate passed more than one of the statistical tests, as it has been 
determined that this requirement significantly reduces the number 
of false detections made. As can be seen from Table[T]this discounts 
the g-mode candidate and some of the detections in the p-mode 
range. For comparison purposes the last column of Table[T]indicates 
whether the candidates were also detected when Bayesian statistics 
were used (see Sections fT.SI and IT!?! . 

Several multiplet detections lie close to the predicted mode fre- 
quencies but are not highlighted by a green square in Fig.|7] This 
is because not all of the detected components could be associated 
with the predicted frequencies of the components of one mode. 
For example, a triplet was detected in the BiSON and MDI data 
at ~ 1039 /jHz. However, the mode closest to the detection is the 
I = I, n = 6 mode, which only has two visible components 
(m = ±1). Since only two of the three detected spikes could be 
associated with model mode frequencies this was not counted as a 
mode candidate. Similarly the multiplet candidate at ~ 537 yuHz is 
closest to the I = 0, n = 3 mode, which cannot be observed as a 
multiplet (as 1 = 0). 

In Fig.|7]we can see that coincident multiplet detections are made 
at ~ 249 yuHz in the BiSON and GOLF data and the GOLF and 
MDI data, which do not correspond to a predicted mode frequency. 
One of the detected components is at the same frequency in each 
detection (at 248.474 yuHz). This component is particularly promi- 
nent in the GOLF data. The second component observed in the 
BiSON and GOLF data is not at the same frequency as the second 
component that is observed in the GOLF and MDI data. Therefore, 
these detections are not truly coincident. 

It is apparent from Fig.|7]that the analysis has uncovered several 
occurrences of P < 1 per cent that lie well away from the predicted 
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mode frequencies. The number of these detections exceeds the 
number expected (< 2) given the range in frequencies searched, the 
threshold probability, and the number of statistical tests performed. 
This is most likely to be due to the misleading significance levels 
assigned when adopting a frequentist approach. It could also indi- 
cate that the statistical distribution in the noise is slightly different 
to the Gaussian distribution, which was assumed when deriving the 
statistical tests. It is also worth noting that given the large number 
of g modes that can theoretically be detected, particularly at very 
low frequencies, it is hardly surprising that some of the detections 
lie close to predicted mode frequencies. 

7.2.1 Searching for g mode peaks in contemporaneous data 

As pr eviously mentioned the results obtained by iGarcfa et al.l 
( I2OO7I) could imply that g modes may have lifetimes of the or- 
der of months, not years. If this is true they will have a resolved 
width in a frequency-amplitude spectrum and so we have searched 
the contemporaneous, very low-frequency (v < 350yuHz) BiSON, 
GOLF and MDI data for evidence of prominent peaks with various 
widths. We have searched for clusters containing two, three, four 
and five spikes. We allow a cluster to be spread over twice the width 
of the mode and took the widths of the modes to be the number 
of bins covered if the mode lifetimes were 1, 2, 3 and 4 months, 
respectively. However, no statistically significant peaks were found 
within 4pHz of the model g-mode frequencies. 



7.3 Results of searching the data using Bayesian techniques 

The statistics outlined in Sections [4l6l were used to search BiSON, 
GOLF and MDI data in pairs with a Bayesian approach, respec- 
tively. The start times of each pair of time series were separated 
by 24 h so that only a minimal amount of comm on noise was 
presen t in the data. The 'bin-shifting' strategy of IChaplin et al.l 
( l2002h was again applied. Equation I ll2t was used to determine the 
posterior probability, /7(H00|x), where initially we took the height 
of the p modes, H, to be given by the extrapolation described in 
iBroomhall eTai] ( l2008h (see Section |4Tt . We have assumed that 
g mode lifetimes are sufficiently long for all of their power to be 
contained in one bin and the maximum g-mode height was taken 
to be 9 mm^ s~^bin~' at all frequencies. The frequency range of 
250 - 1500 juHz was searched for solar p modes and the frequency 
range of 50 - 350yuHz was searched for g modes. 

The top panel of Fig. [8] shows the BiSON spectrum over the 
range 1300 pHz to 1500 pHz. The crosses above the spectrum show 
the p-mode fre quencies predicted by the Saclay seismic model 
jTurck-Chieze e t al. 2001). As can be seen, it is difficult to distin- 
guish possible mode candidates from the background noise. The 
middle panel of Fig. [8] shows [1 - p(HOO|ji:)], for observing a 
prominent feature in the BiSON and GOLF data, calculated with a 
uniform prior of po(v) = 0.5. As [l-p(HOO|x)] approaches unity the 
more likely we are to reject the HOO hypothesis that the frequency- 
power spectrum contains noise only. Four mode candidates are 
clearly visible in this plot, at ~ 1330/jHz, ~ 1393 yuHz and two 
at ~ 1472 yuHz. All four candidates lie close to predicted mode 
frequencies. The bottom panel shows the posterior probability over 
the same range of frequencies, only this time the top-hat prior prob- 
ability described in Section[5]was used. All of the mode candidates 
are allowed by the uneven prior 

Fig.|9]is an echelle diagram showing the results of searching the 
BiSON, GOLF and MDI data using a Bayesian approach. Mode 
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Figure 8. Top panel: a section of the BiSON frequency-power spectrum. 
Middle panel: the posterior probability observed when BiSON and GOLF 
data are searched for p modes. The BiSON data was shifted to start 24 h 
after the GOLF data. A uniform prior probability of po = 0.5 was used. 
Bottom panel: The posterior probability observed when BiSON and GOLF 
data were searched for p modes. However, in this panel a the inverted top- 
hat prior described in Section[5]was used. 

candidates have been plotted if p(llOO\x) < O.Ol. Therefore, at the 
plotted frequencies, we reject the HOO hypothesis that the data con- 
tains noise only, in favour of the H 1 1 hypothesis which states that 
there is some signal in the data. The non-uniform prior described 
in Section [5] was applied and resulted in one p-mode candidate 
being discounted. This candidate was detected in the BiSON-MDI 
combination and the GOLF-MDI combination. The frequency of 
the candidate corresponds to a spacecraft frequency that is very 
prominent in the MDI frequency-power spectrum. Therefore, the 
fact that this candidate is discounted is a good example of how ad- 
vantageous the uneven prior can be. Very few detections are made 
in the p-mode frequency range. However, all of the candidates that 
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Figure 9. An echelle plot, modulo 145 /iHz, marking locations in frequency 
of occurrences uncovered by the Bayesian test searches. Realistic estimates 
of the upper limit on the modes powers were used to calculate the Bayesian 
probability, p(HO|,v). The different symbols represent the dilferent pairings 
of data that were searched (see legend). 
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Figure 10. An echelle plot, modulo 145 pHz, marking locations in fre- 
quency of occurrences uncovered by the Bayesian test searches. The maxi- 
mum power, H, was overestimated when calculating p(HO|x). The different 
symbols represent the different pairings of data that were searched (see 
legend). 



are detected correspond to mode frequencies. This implies that the 
Bayesian approach can successfully discriminate between signal 
and noise. No candidates are found in the g-mode range. 

Fig.[TO]is an echelle diagram that again shows the results of using 
Bayesian methods to search for solar oscillations. The only differ- 
ence between Figs [9] and [TO] is that larger maximum heights, H, 
have been used when determining the posterior probability. When 
searching for the p modes we set H to be the height a mode at 
1500 yuHz is expected to have. When searching for g modes the 
maximum height was taken to be 36 mm^ s^'bin"', which is four 



times larger than the heights assumed when producing Fig. [9] (see 
Section 1431 for details). More mode candidates are detected in the 
p-mode range than were found when the lower heights were used 
(see Fig. |9}. Furthermore, the p(HOO|x) is lower when the larger 
values of H were used for all of the candidates that are plotted in 
both Figs |9] and (TO] This could indicate that we were correct to 
overestimate the upper limit of H when determining p(x|Hl 1). 

Fig. [TO] shows that one g-mode candidate was detected. Even 
though this candidate is close to a predicted p-mode frequency 
(/ = 0, n = 1) it has been classified as a g-mode candidate as it was 
detected when the value for H was assumed to be 36 mm" s^^bin"' , 
i.e. H was set using the g-mode models. However, we do not rule 
out the possibility that this detec tion is a p mode. A peak i n this 
frequency region is mentioned bv lTurck-Chieze et alj liooj) , who 
looked at GOLF data only. 

Table[2]lists the mode candidates that are shown in Fig.llOlalong 
with the minimum observed posterior probability, /:)(HOO|a"). The 
value of p(HOO|x) quoted in Table [2] is dependent on which of the 
two sets of data were considered. The / = 1 , n = 9, fn = - 1 mode 
candidate at 1472.433 yuHz was detected at a significant level in all 
three combinations of data (see Fig. [TO}. It was most prominent 
in the BiSON-GOLF combination, when /XHOOU) = 1.6 x 10"'. 
When this candidate was observed in the GOLF-MDI combination 
p(HOO|x) = 2.3x10"^ and when it was observed in the BiSON-MDI 
combination p(HOO|x) = 1.4x lO""*. When a candidate was detected 
in more than one combination of data sets the minimum value of 
/:i(HOO|x) is included in Table[2] In fact all of the included values of 
/:i(HOO|a:) were taken from the BiSON-GOLF combination, except 
for the g mode candidate, which was observed in the GOLF-MDI 
combination. However, unlike the / = 1, « = 9, m = -I candidate, 
some mode candidates were observed to be more prominent in the 
BiSON-MDI combination than in the GOLF-MDI combination. 
For example, the / = \, n = 8, m = -1 mode candidate was 
found to have a posterior probability of 7.4 x 10"** when observed 
in the BiSON-MDI combination and 6.6 x 10"' when observed 
in the GOLF-MDI combination. It should be noted that all of the 
results described in this section were dependent on the assumptions 
made when determining p(HOO\x). Different initial assumptions 
may have led to different results. 



7.4 Results of searching smoothed spectra with Bayesian 
statistics 

As previously mentioned it is likely that the power of the 
p modes is spread o ver more than one frequency bin. Furthermore, 
lGarciaetal1j2007h found evidence that could suggest that g modes 
have short lifetimes, and so g-mode powers could also be spread 
across more than one frequency bin. With this in mind we have used 
Bayesian techniques to search smoothed frequency-power spectra. 

We have used the methods described in Section 14.31 to search 
smoothed frequency-power spectra. We have then used equation 
( 1121 ) to determine the posterior probability. Again the frequency- 
power spectra were searched in pairs with one time series shifted 
to start 24 h after the other. 

Fig. [TT] shows the results of searching the BiSON, GOLF and 
MDI data when the frequency-power spectra were smoothed over 
various numbers of bins (all even values between 2 and 20). The 
uneven prior has been applied and removes 17 false detections, 
some of which can be attributed to daily harmonics and spacecraft 
frequencies. 

Table [3] contains the frequencies of the observed mode candi- 
dates. Notice that the number of bins we needed to smooth over, R, 
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Table 3. Candidates found when using Bayesian statistics to search smoothed spectra. R is the number of bins we have smoothed over. 



; 


n 


m 


Frequency 
(A<Hz) 


Minimum 
Posterior 
probability 
[P(HOO|.v)] 


R at which 
minimum 

obtained 


Distance 
from model 
frequency 





7 





1120.01 ±0.05 


4.4 X 10-^^ 


14 


1.62 


1 


7 


-1 


1182.64 ± 0.02 


5.4 X 10"' 


12 


2.82" 


1 


7 


-1 


1185.17 ±0.02 


1.2 X 10"' 


12 


0.29° 


2 


7 


-2 


1 244 90 + Ofi 


1.3 X 10"' 


16 


5.07* 





8 





1259.08 ± 0.05 


7.2 X 10"' 


14 


4.52 


1 


8 


-1 


1329.17 ±0.08 


6.4 X 10-25 


20 




1 


8 


+ 1 


1329.93 ±0.08 


4.2 X 10"* 


20 








mean 


1329.55 ±0.11 






0.36 


2 


8 


-2 


1393.87 ±0.07 


1.3 X lO""* 


18 


0.34* 





9 





1407.36 ± 0.08 


2.4 X 10"'" 


18 


0.63* 


1 


9 


-1 


1472.43 ± 0.04 


8.2 X 10"'** 


10 




1 


9 


+ 1 


1473.19 ±0.07 


2.5 X 10"'2 


18 








mean 


1472.81 ±0.08 






0.38 



"Difference with model frequency assumes the m = - I component lies ±0.4 /iHz from central frequency. 
'Difference with model frequency assumes m = -2 components lies ±0.8 /iHz from central frequency. 



Table 2. Candidates found using Bayesian statistics. 



/ 


It 


in 


Frequency 

(Mm 


Posteiior 
probability 
[/XHOOU)] 


Distance 
from model 
frequency 
(jjHz) 


2 


-2 


+2 


258.916 ± 0.002 


7.0 X 10"' 


1.236 





6 





972.613 ±0.002 


4.2 X 10"' 


0.132 


1 


7 


-1 


1185.197 ±0.005 


5.1 X lO""* 


0.256" 





8 





1263.209 ±0.007 


6.2 X 10"'' 


0.397 


1 


8 


-1 


1329.235 ± 0.005 


4.5 X 10"" 


0.273" 


2 


8 


-2 


1393.872 ±0.007 


3.6 X 10"' 


0.336* 


1 


9 


-1 


1472.433 ± 0.008 


1.6 X 10"' 




1 


9 


+ 1 
mean 


1473.270 ± 0.009 
1472.852 ± 0.006 


1.9 X lO""* 


0.34 



"Difference with model frequency assumes the m = - 1 component lies 
±0.4 fiHz from central frequency. 

'Difference with model frequency assumes m = —2 components lies 
±0.8 //Hz from central frequency. 

to minimize the observed posterior probability, /'(HOO|x), is reason- 
ably large (column 5). This highlights the truly resolved nature of 
the p mode profiles in the frequency-power spectra. Theoretically, 
the minimum value of p( HOO|x) will occur when R is equal to 
the linewidth of the mode l lApDourchaux et aT I l2009h . As can be 
seen in Table |3]i? does, in general, decrease with mode frequency. 
However, there are quite a few cases where this is not the case. This 
could be because we are looking for modes whose powers are only 
marginally greater than the background noise. Consequently, the 
mode profiles are difficult to distinguish from the noise, particularly 
in the limbs of the profiles and so the widths of the modes remain 
uncertain. The errors on the frequencies, given in Table [3] are the 
bin width of the smoothed spectra in which the minimum proba- 
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Figure 11. An echelle plot, modulo 145 /jHz, marking locations in fre- 
quency of occuiTences uncovered by the Bayesian test searches of smoothed 
spectra. The frequency-power spectra were smoothed over two, four, six, 
eight. 10, 12, 14, 16, 18 and 20 bins. The different symbols represent the 
different pairings of data that were searched (see legend). 



bilities were observed (i.e. 0.004^ //Hz). These error estimates are 
significantly larger than those given in Table |2] 

Although formally allowed by the uneven prior, we regard any 
detections that are further than 1 //Hz from the predicted frequen- 
cies with suspicion as, in general, the diff^erence is significantly less 
than this. Notice that there are two / = 1, « = 7, m = -1 candidates 
and two / = 0, ;i = 8 candidates. In each case the candidate that 
lies closest to the predicted mode frequency also has the lowest 
posterior probability and so these candidates are more likely to 
represent the actual modes. No g-modes candidates were detected 
regardless of the assumed width and power Once again we note 
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that the resuhs described in this section are dependent on the initial 
assumptions made when using the Bayesian approach. 



8 DISCUSSION OF THE RESULTS 

We begin by comparing the results observed in the unsmoothed 
spectra using frequentist and Bayesian search methods. The fre- 
quencies of candidates that are detected by both approaches 
show very good agreement with each other and w ith previously 



claim e d detections (see e.g 



Toutain et al 



1998 ; Bertello et aU 



200d iGarcia et al.l |200lL IChaplin et al.l 12002 : Broomhall et^ 



20071 : ISalabert et all I2OOS ). When the frequentist approach was 
applied, more false detections (i.e. detections that were found to lie 
away from the predicted mode frequencies) were observed than ex- 
pected. We believe this is significant and highlights the misleading 
nature of the frequentist approach. When the Bayesian approach 
was applied significantly fewer false detections were observed, 
even before the uneven prior probability was applied. Furthermore, 
the Bayesian approach allows us to use our knowledge of the Sun 
in a statistically rigourous manner to guide the frequencies at which 
we consider detections to be mode candidates (with the uneven 
prior probability). Although Bayesian methods may lead to more 
stringent threshold levels and consequently fewer detections we 
feel that this is a small price to pay for the improved accuracy 
and confidence one achieves in any detections. It is possible that 
using the uneven prior probability may discount detections that are 
in fact real mode signatures (type II errors). However, we feel that 
use of the uneven prior probability is justified as it also helps to 
minimize the number of noise signatures that are considered as 
mode candidates (type I errors). The Bayesian approach is also 
less open to misinterpretation than the frequentist approach. How- 
ever, once again we note that the Bayesian approach is reliant on 
the assumptions made when determining the posterior probability. 
Different assumptions may lead to different mode candidates being 
uncovered. For example, the assumption that the rotational splitting 
of g modes is 0.4 fiHz may be incorrect l lGarcfa et al.ll2007h . 

Mode candidates were also detected in smoothed spectra. This 
method was particularly successful at higher frequencies, where the 
modes have broader widths. It is worth noting that as the p-mode 
powers and widths were determined by extrapolation it is likely 
that the values used here are more accurate higher in the frequency 
search range. This may have influenced the results especially if the 
mode widths were overestimated. We were able to eliminate mode 
candidates that were positioned away from the predicted mode 
frequencies by employing the non-tmiform prior probability. This 
cannot be done for the frequentist approach as we are effectively 
using a priori knowledge to guide the regions in frequency we 
search. The use of a priori knowledge in a statistically rigourous 
manner can only be done through Bayesian statistics. 

The g-mode candidate that was observed in the frequentist ap- 
proach was not detected when Bayesian statistics were used. One 
g mode candidate was detected using the Bayesian approach. How- 
ever, each detection was only made in one combination of the data 
sets and neither candidate was detected with both the frequentist 
and Bayesian approaches. Therefore, we regard these two can- 
didates with suspicion and feel that each candidate still requires 
further investigation and independent confirmation. The candidate 
that was detected with the Bayesian approach was not detected 
when the spectra were smoothed. This could indicate that, if it is 
a g mode, it has a lifetime that is longer than the lifetimes assumed 
here. 



ACKNOWLEDGEMENTS 

The authors thank S. Basu and A. Serenelli for the solar models that 
were used to produce Fig.|2] We would like to thank all members of 
the Phoebus collaboration for much valued discussions concerning 
this work. This paper utilizes data collected by the Birmingham 
Solar-Oscillations Network (BISON). We thank the members of the 
BISON team, both past and present, for their technical and analyt- 
ical support. We also thank P. Whitelock and P. Fourie at SAAO, 
the Carnegie Institution of Washington, the Australia Telescope 
National Facility (CSIRO), E.J. Rhodes (Mt. Wilson, Californa) 
and members (past and present) of the lAC, Tenderize. BiSON is 
funded by the Science and Technology Facilities Council (STD). 
The authors also acknowledge the financial support of STD. We 
thank the referee for insightful comments. 



REFERENCES 

Appourchaux T., 2004, A&A, 428, 1039 

Appourchaux T., 2008, Astron. Nachi'., 329, 485 

Appourchaux T., et al., 2000, ApJ, 538, 401 

Appourchaux T., et al., 2010, A&AR, 18, 197 

Appourchaux T., Samadi R., Dupret M., 2009, A&A, 506, 1 

Bahcall J. N., Serenelli A. M., Basu S., 2006, ApJS, 165, 400 

Belkacem K., Samadi R., Goupil M. J., Dupret M. A., Brun A. S., Baudin 

F.,2009, A&A, 494, 191 
Berger J., Sellke T., 1987, J. American Statistical Association, 82, 112 
Bertello L., et al., 2000, ApJ, 535, 1066 

Broomhall A. M., Chaplin W. J., Elsworth Y., Appourchaux T.. 2007, 
MNRAS, 379, 2 

Broomhall A. M., Chaplin W. J., Elsworth Y, Fletcher S. T., 2008, Astron. 
Nachr., 329, 461 

Chaplin W., Elsworth Y, Isaak G., Marchenkov K., Miller B., New R., 

Pinter B., 2002, MNRAS, 336, 979 
Chaplin W. J., Elsworth Y, Howe R., Isaak G. R., McLeod C. R, Miller 

B. A., New R., 1997, MNRAS, 287, 51 
Chaplin W. J., Elsworth Y, Isaak G. R.. Miller B. A.. New R., 1999, 

MNRAS, 308, 424 
Chaplin W. J., Serenelli A. M., Basu S., Elsworth Y, New R., Verner G. A., 

2007, ApJ, 670, 872 
Christensen-Dalsgaard J., Berthomieu G., 1991. Solar interior and atmo- 
sphere. Univ. of Arizona Press, Tucson, AZ, pp 401^78 
Elsworth Y, Howe R., Isaak G. R., McLeod C. P, Miller B. A., New R., 

Speake C. C, Wheeler S. J., 1994, MNRAS, 269, 529 
Fletcher S. T., Chaplin W. J., Elsworth Y, Schou J., Buzasi D., 2006, 

MNRAS, 371, 935 
Gabriel A. H., et al., 2002, A&A, 390, 1119 
Garcia R. A., et al., 2001, Sol. Phys., 200, 361 
Garcia R. A., et al., 2004, Sol. Phys., 220, 269 
Garcia R. A., et al., 2005, A&A, 442, 385 
Garcia R. A., et al., 2008, Astron. Nachr., 329, 476 

Garcia R. A., Mathur S., Ballot J., Eff-Darwich A., Jimenez-Reyes S. J., 

Korzennik S. G., 2008, Sol. Phys., 251, 119 
Garcia R. A., Turck-Chieze S., Jimenez-Reyes S. J., Ballot J., Palle P. L., 

Eff-Darwich A., Mathur S., Provost J., 2007, Sci., 316, 1591 
Houdek G., Balmforth N. J., Christensen-Dalsgaard J., Gough D. O., 1999, 

A&A, 351,582 

Mathur S., Eff-Darwich A., Garcia R. A., Turck-Chieze S., 2008, A&A, 
484,517 

Mathur S., Turck-Chieze S., Couvidat S., Garcia R. A.. 2007, ApJ, 668, 
594 

Provost J., Berthomieu G., Morel P, 2000, A&A. 353. 775 

Salabert D., Leibacher J., Appourchaux T., Hill R, 2009, ApJ, 696, 653 

Scherrer P H., et al., 1995, Sol. Phys., 162, 129 

Sellke T., Bayarri M., Berger J., 2001, The American Statistician, 55, 62 
Sturrock P A., 2008, arXiv:0809.0276vl 



Statistical searches for low-frequency solar oscillations 



Toutain T., Appourchaux T., Frohlich C, Kosovichev A. G.. Nigam R., 

Scherrer P. H., 1998, ApJ, 506, L147 
Turck-Chieze S., et al„ 2001, ApJ, 555, L69 
Turck-Chieze S., et al., 2004, ApJ, 604, 455 

Zaatri A., Provost J., Berthomieu G., Morel P, Corbai'd T., 2007, A&A, 
469, 1 145 



16 A. -M. Broomhall et al. 

APPENDIX A: TABLE OF ASSUMPTIONS 
Table Al. Assumptions made with each approach. 



Property 



Frequentist 



Bayesian 



Underlying statistics Gaussian 

(for frequency-amplitude spectra) 



Threshold probability 
Rotational splitting 
Power of p modes 



0.01 
0.4^Hz 
None 



2d.o.f 

(for frequency-power spectra) 

0.01 

0.4 ^iHz 

Based on extrapolations 
(see Section l4~n : 

0.1 mm^ s"- < y2 < 38 mm^ s"^ 



Power of e modes 



None 



Based on models 
(see Section l4~n : 
y2 = 9 or36mm2 s" 



Frequencies considered 
as p-mode candidates 



Closer than 1 ^Hz 
to a model mode frequency 



Allowed by uneven prior probability 
i.e. an inverted top-hat distribution, 
based on frequencies predicted by 
5000 models 



Frequencies considered 
as g-mode candidates 



Lifetimes of p modes 



Closer than 1 juHz 
to a model mode frequency 



Long enough for the power to 
be confined to a few bins only 



Allowed by uneven prior probability 
i.e. an inverted top-hat distribution, 
based on frequencies predicted by 
models and an uncertainty of 1 per cent 

Based on extrapolations 
(see Section l4?Tt 
795 d > T > 35 d 



Lifetimes when searching 
for g-mode single spikes 



Greater than the length of 
the time series i.e. t > 307 1 d. 



Greater than the length of 
the time series i.e. r > 3071 d 



Lifetimes when searching 1, 2, 3, and 4 months 1, 2, 3, and 4 months 

for g-mode peaks 



